Online Compression Caching
نویسندگان
چکیده
Motivated by the possibility of storing a file in a compressed format, we formulate the following class of compression caching problems. We are given a cache with a specified capacity, a certain number of compression/uncompression algorithms, and a set of files, each of which can be cached as it is or by applying one of the compression algorithms. Each compressed format of a file is specified by three parameters: encode cost, decode cost, and size. The miss penalty of a file is the cost of accessing the file if the file or any compressed format of the file is not present in the cache. The goal of a compression caching algorithm is to minimize the total cost of executing a given sequence of requests for files. We say an online algorithm is resource competitive if the algorithm is constant competitive with a constant factor resource advantage. A well-known result in the framework of competitive analysis states that the least-recently used (LRU) algorithm is resource competitive for the traditional paging problem. Since compression caching generalizes the traditional paging problem, it is natural to ask whether a resource competitive online algorithm exists or not for compression caching. In this work, we address three problems in the class of compression caching. The first problem assumes that the encode cost and decode cost associated with any format of a file are equal. For this problem we present a resource competitive online algorithm. To explore the existence of resource competitive online algorithms for compression caching with arbitrary encode costs and decode costs, we address two other natural problems in the aforementioned class, and for each of these problems, we show that there exists a non-constant lower bound on the competitive ratio of any online algorithm, even if the algorithm is given an arbitrary factor capacity blowup. Thus, we establish that there is no resource competitive algorithm for compression caching in its full generality.
منابع مشابه
Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems
Compression and caching are two important issues for a large online image server. In this paper, we propose a new approach to compression by exploring image similarity in large image archives. An adaptive vector quantization (VQ) approach using content categorizations, including both the semantic level and the feature level, is developed to provide a differential compression scheme. We show tha...
متن کاملImproving the I/O Throughput for Data-Intensive Scientific Applications with Efficient Compression Mechanisms
Today’s science is generating significantly larger volume of data than before. Data compression can potentially improve application performance. However, in many scientific applications and especially in large scale parallel scientific applications, each process often just accesses parts of the data. This can result in some data that are decompressed by a process but not used. General compressi...
متن کاملURL Forwarding and Compression in Adaptive Web Caching
Web caching is generally acknowledged as an important service for alleviating focused overloads when certain web servers’ contents suddenly become popular. Cooperative caching systems are more effective than independent caches due to the larger collective backing store that cooperation creates. One such system currently being developed at UCLA, Adaptive Web Caching (AWC), uses an application-le...
متن کاملWhat, When and Where to Cache: A Unified Optimization Approach
Caching algorithms are usually described by the eviction method and analyzed using a metric of hit probability. Since contents have different importance (e.g. popularity), the utility of a high hit probability, and the cost of transmission can vary across contents. In this paper, we consider timer-based (TTL) policies across a cache network, where contents have differentiated timers over which ...
متن کاملA System for Transparent File Compression With Caching
Transparent le compression systems handle le compression and decompression of les automatically; they are thus transparent to users. The cost of many transparent le system compression schemes, however , is that le accesses require substantial processor time for compression and decompression. One approach to alleviating this problem is to cache recently accessed les (active les) in uncompressed ...
متن کامل